A Voice Production Model for Waveform Coding Speech
نویسنده
چکیده
Voice production models are used for speech compression thus reducing the encoding data rates and storage space for speech signaling processing. The legacy voice production model consists of a vocal source model and a vocal tract model. This paper presents a simpler voice production model adapted to waveform coding of speech: the Mathieu Waveform Coder (MWC) model. The MWC model yields weighted basis functions to waveform code the speech signal. Each Mathieu basis function represents an infinite series of sinusoids rather than a single sinusoidal spectral line. The research motivation is to determine if waveform coding speech using an elliptical membrane model (i.e., the Mathieu basis functions) is more efficient than coding speech using a circular membrane model (i.e., the sine basis functions) . Preliminary results indicate that waveform coding using the MWC model provides a superior estimate of the speech signal compared to coding with an equal number of sinusoids.
منابع مشابه
Low-bit-rate Speech Coding
Low-bit-rate speech coding, at rates below 4 kb/s, is needed for both communication and voice storage applications. At such low rates, full encoding of the speech waveform is not possible; therefore, low-rate coders rely instead on parametric models to represent only the most perceptually-relevant aspects of speech. While there are a number of different approaches for this modeling, all can be ...
متن کاملGlottal closure and opening detection for flexible parametric voice coding
The knowledge of glottal closure and opening instants (GCI/GOI) is useful for many speech analysis applications. A Pitchsynchronous waveform encoding of voice is one such application. In this paper, a dynamic programming is employed to solve for the global close/open phase segmentation based on the polynomial parametric waveform of the derivative glottal waveform and its quasi-periodicity. Not ...
متن کاملA new 2-kbit/s speech coder based on normalized pitch waveform
Speech coding at very low bitrate is useful for purposes such as voice communication over computer networks. However, speech coding at around 2.0 kbit/s is di cult for CELP coders while maintaining a high quality. In this paper, a speech coding model called `normalized pitch waveform' and its quantization scheme are presented, aiming for effective compression coding of the `voiced' speech. List...
متن کاملA study on the recognition of low bit-rate encoded speech
Digital speech communications are the future trend in the Internet and mobile phones. The low bit-rate coding of speech signals is the essential requirement in the concern of channel bandwidth and transmission efficiency. The voice-based services will become more attractive to the service providers. Many voice-driven applications require that users must be authorized and able to be identified. ...
متن کاملChapter 10 Transmission and Storage 10.1 Overview
Until the late seventies, research in speech compression followed two di erent directions: vocoders (abbreviation of voice coders) and waveform coders. The two approaches substantially di er in their underlying principles and performance. Whereas the rst explore our knowledge of speech production, attempting to represent the signal spectral envelope in terms of a small number of slowly varying ...
متن کامل